Exploiting Limited Access Distance of Ode Systems for Parallelism and Locality in Explicit Methods∗
نویسنده
چکیده
The solution of initial value problems of large systems of ordinary differential equations (ODEs) is computationally intensive and demands for efficient parallel solution techniques that take into account the complex architectures of modern parallel computer systems. This article discusses implementation techniques suitable for ODE systems with a special coupling structure, called limited access distance, which typically arises from the discretization of systems of partial differential equations (PDEs) by the method of lines. It describes how these techniques can be applied to different explicit ODE methods, namely embedded Runge–Kutta (RK) methods, iterated RK methods, extrapolation methods, and Adams–Bashforth (AB) methods. Runtime experiments performed on parallel computer systems with different architectures show that these techniques can significantly improve runtime and scalability. By example of Euler’s method it is demonstrated that these techniques can also be applied to devise high-performance GPU implementations.
منابع مشابه
Locality Optimized Shared-Memory Implementations of Iterated Runge-Kutta Methods
Iterated Runge-Kutta (IRK) methods are a class of explicit solution methods for initial value problems of ordinary differential equations (ODEs) which possess a considerable potential for parallelism across the method and the ODE system. In this paper, we consider the sequential and parallel implementation of IRK methods with the main focus on the optimization of the locality behavior. We intro...
متن کاملEffiziente Implementierung eingebetteter Runge-Kutta-Verfahren durch Ausnutzung der Speicherzugriffslokalität
Embedded Runge-Kutta methods are among themost popular numerical solutionmethods for non-stiff initial value problems of ordinary differential equations. While possessing a simple computational structure, they provide desirable numerical properties and can adapt the step size efficiently. Therefore, embedded Runge-Kutta methods can often compute the solution function faster than other solution ...
متن کاملExploiting Locality and Parallelism in Pointer-based Programs
While powerful optimization techniques are currently available for limited automatic compilation domains, such as dense array-based scientific and engineering numerical codes, a similar level of success has eluded general-purpose programs, specially symbolic and pointer-based codes. Current compilers are not able to successfully deal with parallelism in those codes. Based on our previously deve...
متن کاملExploiting Data Locality on Scalable
OpenMP ooers a high-level interface for parallel programming on scalable shared memory (SMP) architectures providing the user with simple work-sharing directives while relying on the compiler to generate parallel programs based on thread parallelism. However, the lack of language features for exploiting data locality often results in poor performance since the non-uniform memory access times on...
متن کاملIntegrated technique for the optimization of healthcare facility problem
Access to basic health care services is one of the major issues in developing nations. The location of health care facilities is an important aspect in health care delivery. It is therefore necessary for health care services to be located optimally to serve the demands well. This study develops an integrated methodology in the field of healthcare location which minimizes the weighted average Eu...
متن کامل